Asynchronous-transition HMM

نویسندگان

  • Shigeki Matsuda
  • Mitsuru Nakai
  • Hiroshi Shimodaira
  • Shigeki Sagayama
چکیده

We propose a new class of hidden Markov model (HMM) called asynchronous-transition HMM (AT-HMM). Opposed to conventional HMMs where hidden state transition occurs simultaneously to all features, the new class of HMM allows state transitions asynchronized between individual features to better model asynchronous timings of acoustic feature changes. In this paper, we focus on a particular class of AT-HMM with sequential constraints based on a novel concept of “state tying along time”. To maximize the advantage of the new model, we also introduce feature-wise state tying technique. Speaker-dependent speech recognition experiments demonstrat.ed error reduction rates more than 30% and 50% in phoneme and isolated word recognitions, respectively, compared with conventional HMMs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asynchronous-transition Hmm for Acoustic Modeling

We propose a new class of hidden Markov model (HMM) which we call Asynchronous-Transition HMM (AT-HMM) to model asynchronous temporal structure of acoustic feature sequences. Conventional HMM models a sequence of feature vectors, while temporally changing patterns of acoustic features do not necessarily synchronize with each other. In this paper, AT-HMMs with and without sequential constraints ...

متن کامل

Asynchrony with trained transition probabilities improves performance in multi-band speech recognition

One of the central themes in multi-band automatic speech recognition (ASR) is to devise a strategy for recombining sub-band information. This in turn raises two questions: (1) at what phonetic unit should the recombination take place? (2) How asynchronously should the sub-bands be run? Theoretically asynchronous multi-band ASR should perform at least as well as synchronous multi-band ASR. Howev...

متن کامل

Unsupervised Modeling of Dialog Acts in Asynchronous Conversations

We present unsupervised approaches to the problem of modeling dialog acts in asynchronous conversations; i.e., conversations where participants collaborate with each other at different times. In particular, we investigate a graph-theoretic deterministic framework and two probabilistic conversation models (i.e., HMM and HMM+Mix) for modeling dialog acts in emails and forums. We train and test ou...

متن کامل

Flexible Harmonic/stochastic Modeling for Hmm-based Speech Synthesis

In this paper the preliminary results, of a new approach on speech modeling for statistical parametric HMM-based speech synthesis are presented. The proposed system is based on a flexible pitch-asynchronous harmonic/stochastic model (HSM) [1]. The speech is modeled as the superposition of two components: a harmonic component and a stochastic or aperiodic component. The fact that the synthesis m...

متن کامل

Efficient speech enhancement based on left-right HMM with state sequence detection using LRT

Since the conventional HMM (Hidden Markov Model)-based speech enhancement methods try to improve speech quality by considering all states for the state transition, hence introduce huge computational loads inappropriate to real-time implementation. In the Left-Right HMM (LR-HMM), only the current and the next states are considered for a possible state transition so to reduce the computation comp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000